RAW SEQUENCE LISTING 



The Biotechnology Systems Branch of the Scientific and Technical 
Information Center (STIC) no errors detected. 

Application Serial Number: 

Source: P(J~/f0 

Date Processed by STIC: 



ENTERED 



- "75 



BEST AVAILABLE COPY 



RAW SEQUENCE LISTING DATE: 07/07/2005 

PATENT APPLICATION: US/10/509 , 773 TIME: 13:28:01 



Input Set : D:\seqlist.txt 

Output Set: N:\CRF4\07072005\J509773.raw 

3 <110> APPLICANT: Delaney, Allen 

5 <120> TITLE OF INVENTION: Cancer Associated Protein Phosphatases and the 

6 uses 

8 <130> FILE REFERENCE: SMAR-044 
10 <140> CURRENT APPLICATION NUMBER: 10/509,773 
C--> 11 <141> CURRENT FILING DATE: 2004-09-28 

13 <150> PRIOR APPLICATION NUMBER: CA03/00393 

14 <151> PRIOR FILING DATE: 2003-03-19 

16 <150> PRIOR APPLICATION NUMBER : 60/368,859 

17 <151> PRIOR FILING DATE: 2002-03-28 
19 <160> NUMBER OF SEQ ID NOS : 12 

21 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 1520 

25 <212> TYPE: DNA 

2 6 <213> ORGANISM: Homo sapiens 
28 <220> FEATURE: 

2 9 <221> NAME/KEY: misc_feature 

30 <222> LOCATION: (0)...(0) 

31 <223> OTHER INFORMATION: MKPX polynucleotide 

3 3 <400> SEQUENCE: 1 

34 ggcacgaggc cgagcctagt gcctcccacg cccggcggcc gcgagccggg gtccgcgagg 60 
3 5 gcggagtggg gcgcggcagc caggaacccg actacgaatc ccagggtgcg ggcgggcgga 120 
3 6 gcgaggaggg acgctgggcc tgcccggtgc gcacgggggc ggggaccggc aaggcgggac 180 

37 catttcccgg cataggctcc ggtgcccctg cccggctccc gccgggaagt tctaggccgc 240 

38 cgcacagaaa gccctgccct ccacgccggg tctctggagc gccctgggtt gcccggccgg 300 

39 tccctgccgc tgacttgttg acactgcgag cactcagtcc ctcccgcgcg cctcctcccc 360 

40 gcccgccccg ccgctcctcc tccctgtaac atgccatagt gcgcctgcga ccacacggcc 420 

41 ggggcgctag cgttcgcctt cagccaccat ggggaatggg atgaacaaga tcctgcccgg 480 

42 cctgtacatc ggcaacttca aagatgccag agacgcggaa caattgagca agaacaaggt 54 0 

43 gacacatatt ctgtctgtcc atgatagtgc caggcctatg ttggagggag ttaaatacct 600 

44 gtgcatccca gcagcggatt caccatctca aaacctgaca agacatttca aagaaagtat 660 

45 taaattcatt cacgagtgcc ggctccgcgg tgagagctgc cttgtacact gcctggccgg 72 0 

46 ggtctccagg agcgtgacac tggtgatcgc atacatcatg accgtcactg actttggctg 780 

47 ggaggatgcc ctgcacaccg tgcgtgctgg gagatcctgt gccaacccca acgtgggctt 840 

48 ccagagacag ctccaggagt ttgagaagca tgaggtccat cagtatcggc agtggctgaa 900 

49 ggaagaatat ggagagagcc ctttgcagga tgcagaagaa gccaaaaaca ttctggccgc 960 

50 tccaggaatt ctgaagttct gggcctttct cagaagactg taatgtacct gaagtttctg 1020 

51 aaatattgca aacccacaga gtttaggctg gtgctgccaa aaagaaaagc aacatagagt 1080 

52 ttaagtatcc agtagtgatt tgtaaacttg tttttcattt gaagctgaat atatacgtag 1140 

53 tcatgtttat gttgagaact aaggatattc tttagcaaga gaaaatattt tccccttatc 1200 

54 cccactgctg tggaggtttc tgtacctcgc ttggatgcct gtaaggatcc cgggagcctt 1260 

55 gccgcactgc cttgtgggtg gcttggcgct cgtgattgct tcctgtgaac gcctcccaag 1320 
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56 gacgagccca gtgtagttgt gtggcgtgaa ctctgcccgt gtgttctcaa attccccagc 1380 

57 ttgggaaata gcccttggtg tgggttttat ctctggtttg tgttctccgt ggtggaattg 1440 

58 accgaaagct ctatgttttc gttaataaag ggcaacttag ccaagtttaa aaaaaaaaaa 1500 

5 9 aaaaaaaaaa aaaaaaaaaa 1520 

62 <210> SEQ ID NO: 2 

63 <211> LENGTH: 184 

64 <212> TYPE: PRT 

65 <213> ORGANISM: Homo sapiens 
67 <220> FEATURE: 

6 8 <221> NAME /KEY : UNSURE 

69 <222> LOCATION: (0) . . . (0) 

70 <223> OTHER INFORMATION: MKPX polypeptide 
72 <400> SEQUENCE: 2 



73 


Met 


Gly 


Asn 


Gly 


Met 


Asn 


Lys 


He 


Leu 


Pro 


Gly Leu Tyr He 


Gly Asn 


74 


1 








5 










10 






15 


75 


Phe 


Lys 


Asp 


Ala 


Arg 


Asp 


Ala 


Glu 


Gin 


Leu 


Ser Lys 


Asn Lys 


Val Thr 


76 








20 










25 






30 




77 


His 


He 


Leu 


Ser 


Val 


His Asp 


Ser 


Ala 


Arg 


Pro Met 


Leu Glu Gly Val 


78 






35 










40 








45 




79 


Lys 


Tyr 


Leu 


Cys 


He 


Pro 


Ala 


Ala 


Asp 


Ser 


Pro Ser 


Gin Asn 


Leu Thr 


80 




50 










55 








60 






81 


Arg 


His 


Phe 


Lys 


Glu 


Ser 


He 


Lys 


Phe 


He 


His Glu 


Cys Arg 


Leu Arg 


82 


65 










70 










75 




80 


83 


Gly 


Glu 


Ser 


Cys 


Leu 


Val 


His 


Cys 


Leu 


Ala 


Gly Val 


Ser Arg 


Ser Val 


84 










85 










90 






95 


85 


Thr 


Leu 


Val 


He 


Ala 


Tyr 


He 


Met 


Thr 


Val 


Thr Asp 


Phe Gly Trp Glu 


86 








100 










105 






110 




87 


Asp 


Ala 


Leu 


His 


Thr 


Val 


Arg 


Ala 


Gly 


Arg 


Ser Cys 


Ala Asn 


Pro Asn 


88 






115 










120 








125 




89 


Val 


Gly 


Phe 


Gin 


Arg 


Gin 


Leu 


Gin 


Glu 


Phe 


Glu Lys 


His Glu 


Val His 


90 




130 










135 








140 






91 


Gin 


Tyr 


Arg 


Gin 


Trp 


Leu 


Lys 


Glu 


Glu 


Tyr 


Gly Glu 


Ser Pro 


Leu Gin 


92 


145 










150 










155 




160 


93 


Asp 


Ala 


Glu 


Glu 


Ala 


Lys 


Asn 


He 


Leu 


Ala 


Ala Pro Gly He 


Leu Lys 


94 










165 










170 






175 


95 


Phe 


Trp 


Ala 


Phe 


Leu 


Arg Arg 


Leu 












96 








180 





















99 <210> SEQ ID NO: 3 

100 <211> LENGTH: 2916 

101 <212> TYPE: DNA 

102 <213> ORGANISM: Homo sapiens 

104 <220> FEATURE: 

105 <221> NAME/KEY: misc_feature 

106 <222> LOCATION: (0) . . . (0) 

107 <223> OTHER INFORMATION: PTP4A1 polynucleotide- 

109 <400> SEQUENCE: 3 

110 aagggcgcct cggcgcgtgt attggctcct tcggctgcgg gccggctcgc ctacgcgctc 60 

111 tgctccgagc cgctcactgc atggtagagt ctggtgcccc cgccgccgcc tgcatcgccg 120 

112 ccaccgccgc tccgccacga ccaccgccgc ctccttgtcc tgcagccacc gccaccgcct 180 



file://C:\CRF4\Outhold\VsrJ509773.htm 



7/7/05 



RAW SEQUENCE LISTING DATE: 07/07/2005 
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113 gtgtcgccgc cgctcgggac cggctgtatg attaggccac aatcttcaat gagtaaacat 240 

114 attcctcaat tctgtggtgt tcttggtcac acatttatgg agtttctgaa gggcagtgga 300 

115 gattactgcc aggcacagca cgacctctat gcagacaagt gaactgtaga aactgattac 360 

116 tgctccacca agaagccccc ataagagtgg ttatcctgga cacagaagtg ttgaaatcca 420 

117 cagagcattt tacaagagtt ctgacctgga tggggtaaac ctcagtgcac ttcttttctg 480 

118 ttggcctcag tattactgga ttgaagaatt gctgcttctt gttaggaggt tcatttcact 540 

119 tatcattact tacaacttca tactcaaagc actgagaatt tcaagtggag tatattgaag 600 

120 tagacttcag tttctttgga tcatttctgt attcaatttt tttaattatt tcataaccct 660 

121 attgagtgtt ttttaactaa attaacatgg ctcgaatgaa ccgcccaget cctgtggaag 720 

122 tcacatacaa gaacatgaga tttcttatta cacacaatcc aaccaatgcg accttaaaca 780 

123 aatttataga ggaacttaag aagtatggag ttaccacaat agtaagagta tgtgaagcaa 840 

124 cttatgacac tactcttgtg gagaaagaag gtatccatgt tcttgattgg ccttttgatg 900 

125 atggtgcacc accatccaac cagattgttg atgactggtt aagtcttgtg aaaattaagt 960 

126 ttcgtgaaga acctggttgt tgtattgctg ttcattgcgt tgcaggcctt gggagagctc 1020 

127 cagtacttgt tgccctagca ttaattgaag gtggaatgaa atacgaagat gcagtacaat 1080 

128 tcataagaca aaagcggcgt ggagctttta acagcaagca acttctgtat ttggagaagt 1140 

129 atcgtcctaa aatgcggctg cgtttcaaag attccaacgg tcatagaaac aactgttgca 1200 

130 ttcaataaaa ttggggtgcc taatgctact ggaagtggaa cttgagatag ggcctaattt 1260 

131 gttatacata ttagccaaca tgttggctta gtaagtctaa tgaagcttcc ataggagtat 1320 

132 tgaaaggcag ttttaccagg cctcaagcta gacagatttg gcaacctctg tatttgggtt 1380 

133 acagtcaacc tatttggata cttggcaaaa gattcttgct gtcagcatat aaaatgtgct 1440 

134 tgtcatttgt atcaattgac ctttccccaa atcatgcagt attgagttat gacttgttaa 1500 

135 atctattccc atgccagaat cttatcaata cataagaaat ttaggaagat taggtgccaa 1560 

136 aatacccagc acaatacttg tatattttta gtaccataca gaagtaaaat cccaggaact 1620 

137 atgaacacta gaccttatgt ggtttattcc ttcagtcatt tcaaacattg aaagtagggc 1680 

138 ctacatggtt atttggctgc tcactttatg tttacatctc ccacattcat accaatatac 1740 

139 gtcaggtttg gttaaccatt gatttttttt tttttttacc aagtcttaca gtgattattt 1800 

140 tacgtgtttc catgtatctc actttgtgct gtattaaaaa aacctccatt ttgaaaatct 1860 

141 acgttgtaca gaagcacatg tctttaatgt cttcagacaa aaaagcctta cattaattta 1920 

142 atgtttgcac tctgaggtgc aacttaacag ggagggcctg agaaaagaat gggagggggc 1980 

143 tattaattat ttttagcaaa atgttgcctt tgtcttgtgc aaacatgtag aatatgctct 2040 

144 ttaatttagt aaaatatttt tttaaaaggt agagatgctt tgttattgta atcataaact 2100 

145 tcctgaaatt cttgtaattt ttttcccata cttatcagaa gtgtgtttac caacttattc 2160 

146 ttgtttgaaa gtgtgatttt ttttttcctt cccaacctct cttgcaaaaa aagaaatggg 2220 

147 tttctgctaa tgaattgagc agacatctaa tattttatat gccttttgga gctgggtaac 2280 

148 ttaatatttg gatacttgac aatttgtttt attatgtaat tgataaaatg gtgatgtgta 2340 

149 ttaatgttag ttcaaccata tatttatact gtctggggat gtgtggttat agttctgtgg 2400 

150 gagaaataat tttgtcagtg ttcaccagct tgtaaaaact tagtgcgaga gctgaaacat 2460 

151 ctaaataaat aatgacatgc atttatcatc attgagattg gtttgcttaa aattaactta 2520 

152 ttttgtagaa gacaaaatga attgcacttc acttaatgtg tgtcctcatc tttttacaaa 2580 

153 taaatgaagg attataaatg atgtcagcat tttagtaaac ttatagacaa aatttgttag 2640 

154 ggtcattcat gaaaacttta atactaaaag cactttccat tatatacttt ttaaaggtct 2 700 

155 agataatttt gaaccaattt attattgtgt actgaggaga aataatgtat agtagaggac 2760 

156 agccttggtt tgtaaagctc agctccacta gttcatggtt tggtgcaact tctgagcctc 2820 

157 agttctctcc tttgcaaatt aataattaca tacctgccta gatttcggaa attaatctaa 2880 

158 atattagtat ctggctacat gatggccatg tcaagt 2916 

161 <210> SEQ ID NO: 4 

162 <211> LENGTH: 173 

163 <212> TYPE: PRT 
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RAW SEQUENCE LISTING DATE : 07/07/2005 

PATENT APPLICATION: US/10/509,773 TIME: 13:28:01 

Input Set : D:\seqlist.txt 

Output Set: N:\CRF4\07072005\J509773.raw 

164 <213> ORGANISM: Homo sapiens 

166 <220> FEATURE: 

167 <221> NAME/KEY: UNSURE 

168 <222> LOCATION: (0)...(0) 

169 <223> OTHER INFORMATION: PTP4A1 polypeptide sequence 
171 <400> SEQUENCE: 4 



172 


Met 


ala 


Arg 


Met 


Asn 


Arg 


Pro 


Ala 


Pro Val 


Glu 


Val 


Thr 


Tyr 


Lys 


Asn 


173 


1 








5 








10 










15 




174 


Met 


Arg 


Phe 


Leu 


He 


Thr 


His 


Asn 


Pro Thr 


Asn 


Ala 


Thr 


Leu 


Asn 


Lys 


175 








20 










25 








30 






176 


Phe 


He 


Glu 


Glu 


Leu 


Lys 


Lys 


Tyr 


Gly Val 


Thr 


Thr 


He 


Val 


Arg 


Val 


177 






35 










40 








45 








178 


Cys 


Glu 


Ala 


Thr 


Tyr 


Asp 


Thr 


Thr 


Leu Val 


Glu 


Lys 


Glu 


Gly 


He 


His 


179 




50 










55 








60 










180 


Val 


Leu 


Asp 


Trp 


Pro 


Phe 


Asp 


Asp 


Gly Ala 


Pro 


Pro 


Ser 


Asn 


Gin 


He 


181 


65 










70 








75 










80 


182 


Val 


Asp 


Asp 


Trp 


Leu 


Ser 


Leu 


Val 


Lys He 


Lys 


Phe 


Arg 


Glu 


Glu 


Pro 


183 










85 








90 










95 




184 


Gly 


Cys 


Cys 


He 


Ala 


Val 


His 


Cys 


Val Ala 


Gly 


Leu 


Gly 


Arg 


Ala 


Pro 


185 








100 










105 








110 






186 


Val 


Leu 


Val 


Ala 


Leu 


Ala 


Leu 


He 


Glu Gly 


Gly 


Met 


Lys 


Tyr 


Glu 


Asp 


187 






115 










120 








125 








188 


Ala 


Val 


Gin 


Phe 


He 


Arg Gin 


Lys 


Arg Arg 


Gly 


Ala 


Phe 


Asn 


Ser 


Lys 


189 




130 










135 








140 










190 


Gin 


Leu 


Leu 


Tyr 


Leu 


Glu 


Lys 


Tyr 


Arg Pro 


Lys 


Met 


Arg 


Leu 


Arg 


Phe 


191 


145 










150 








155 










160 


192 


Lys 


Asp 


Ser 


Asn 


Gly 


His 


Arg 


Asn 


Asn Cys 


Cys 


He 


Gin 








193 










165 








170 















196 <210> SEQ ID NO: 5 

197 <211> LENGTH: 2759 

198 <212> TYPE: DNA 

199 <213> ORGANISM: Homo sapiens 
2 01 <220> FEATURE: 

2 02 <221> NAME/KEY: misc_feature 

203 <222> LOCATION: (0)...(0) 

204 <223> OTHER INFORMATION: PTPN7 polynucleotide sequence 
206 <400> SEQUENCE: 5 

2 07 ggcacgaggc aagaggcagc ctgggggcca cagctgcttc agcagacctc atggctgagt 6 0 
208 gagcctcccc tgggcccagc accccacctc agcatggtcc aagccatggg gggcgctcca 120 
2 09 gagcacagcc gttgaccttg tctttggggg cagccatgac ccagcctccg cctgaaaaaa 180 

210 cgccagccaa gaagcatgtg cgactgcagg agaggcgggg ctccaatgtg gctctgatgc 240 

211 tggacgttcg gtccctgggg gccgtagaac ccatctgctc tgtgaacaca ccccgggagg 3 00 

212 tcaccctaca ctttctgcgc actgctggac acccccttac ccgctgggcc cttcagcgcc 360 

213 agccacccag ccccaagcaa ctggaagaag aattcttgaa gatcccttca aactttgtca 420 

214 gccccgaaga cctggacatc cctggccacg cctccaagga ccgatacaag accatcttgc 480 

215 caaatcccca gagccgtgtc tgtctaggcc gggcacagag ccaggaggac ggagattaca 540 

216 tcaatgccaa ctacatccga ggctatgacg ggaaggagaa ggtctacatt gccacccagg 600 

217 gccccatgcc caacactgtg tcggacttct gggagatggt gtggcaagag gaagtgtccc 660 

218 tcattgtcat gctcactcag ctccgagagg gcaaggagaa atgtgtccac tactggccca 720 
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219 cagaagagga aacctatgga cccttccaga tccgcatcca ggacatgaaa gagtgcccag 780 

220 aatacactgt gcggcacgtc accatccagt accaggaaga gcgccggtca gtaaagcaca 840 

221 tcctcttttc ggcctggcca gaccatcaga caccagaatc agctgggccc ctgctgcgcc 900 

222 tagtggcaga ggtggaggag agcccggaga cagccgccca ccccgggcct atcgtagtcc 960 

223 actgcagtgc agggattggc cggacgggct gcttcatcgc cacgcgaatt ggctgtcaac 1020 

224 agctgaaagc ccgaggagaa gtggacattc tgggtattgt gtgccaactg cggctagaca 1080 

225 gaggggggat gatccagacg gcagagcagt accagttcct gcaccacact ttggccctgt 1140 

226 atgcaggcca gctgcctgag gaacccagcc cctgacccct gccaccctcc ggtggcccag 1200 

227 gtgcctacct ccctcaagcc tgggaaggtg ggtctgggga aagtgggccg agtgatctgg 1260 

228 gggtaccctt gggttggtgt ggggaaggag tgcctcctta gtggtgcttg acagtcacag 132 0 

229 gaagcagcag cagtaaggac aaggggccgg attcaggtct tcaaccactg gccactcctc 1380 

230 ttgccttcct ctgttggccc cagatggaca gtaaggggaa cctccaatgt ctctctgaac 1440 

231 ttaaagacag gagctggcat ttatgacaga caaagaaaga agcccaggtg tcctggtgtt 1500 

232 ctctgagaca ctctttgtga tcttcagttt cctgttctat aacatgaaca taagtgctta 1560 

233 gctgccatga gggaaaagta atgagagaag ttctagaagc cactccagcc actccttcct 162 0 

234 ggggctgaca aaagggtgat tccaagatca tccttcaccc gaggtcctgc ccaagcacag 1680 

235 gccagatgca agaatgggga aaagtctggt cctgatctcc aagtctcaac atcctatcag 1740 

236 tgactctgcc tccctgacca cacatcggaa gggcctggat gacccaatca aaagaaagaa 1800 

237 caaggactct ggttaccctt gcctccaccc atgtgtcata agagtaggct acagaggtga 1860 

238 ccaggcctgg cagttgaaat ctctggaaga gggaacatgt ggggactact cagaggcaaa 1920 

239 gaggagctgc tcctgcctcc atggttgctg gccactccca ccaactactc ttagggaggc 1980 

240 taagcagtct ctgttttgac cttccatggc tcaataatac ctggatgcag gaccactata 2040 

241 ccttgcattt gctgagtaca cctagagagc ttggctgttt ccaaaaacaa tcagggtcat 2100 

242 aaccatccat gcagacatgg aggctcggct gaaccaggac tcctcactgt ctacctgaga 2160 

243 gaatgagcac ccctcatcca tctcagcatc aacacaattt ccaggggacc tcaggtctac 2220 

244 ctcaggactg aaccgccaca cctcaggatt cctcctcctt gaatctgaga ctggctgccc 2280 

245 attctgagat ggggatgaag gtaagatgcc gcatcaccag cacgccgccc ctgacagctg 2340 

246 ccttgatacc agctctctgt ggaaaccccc gaggagttgg atctggagaa cagctgggcc 2400 

247 tcctcactca ggacttctct cctgaagaac acgcagtgct aaaactgagg atgatttccc 2460 

248 taatgcttct gcttggagtc tcttatggag gagctgctcc ttccttacag cttggggatg 2520 

249 gacttcccac acctccacct cccctgagcc ctgagccctg tgagaggacg actgtctatg 2580 

250 caatgaggct cggtgggggg ctctcaagtg cctgatcctg cctggctcag aggcagccag 2640 

251 agggaagcaa ctgacagccc cacaggccct ccctggcact gtccccatct cagagctcag 2700 
2 52 gagggtacaa gctccagaac agtaaccaag tgggaaaata aagacttctt ggatgactg 275 9 

255 <210> SEQ ID NO: 6 

256 <211> LENGTH: 339 

257 <212> TYPE: PRT 

258 <213> ORGANISM: Homo sapiens 

260 <220> FEATURE: 

261 <221> NAME/KEY: UNSURE 

262 <222> LOCATION: (0) . . . (0) 

263 <223> OTHER INFORMATION: PTPN7 polypeptide sequence 

265 <400> SEQUENCE: 6 

266 Met Thr Gin Pro Pro Pro Glu Lys Thr Pro Ala Lys Lys His Val Arg 

267 15 10 15 

268 Leu Gin Glu Arg Arg Gly Ser Asn Val Ala Leu Met Leu Asp Val Arg 

269 20 25 30 

2 70 Ser Leu Gly Ala Val Glu Pro lie Cys Ser Val Asn Thr Pro Arg Glu 
271 35 40 45 
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